Speeding up Warehouse Physical Design Using a Randomized Algorithm
نویسندگان
چکیده
A data warehouse stores information that is collected from multiple, heterogeneous information sources for the purpose of complex querying and analysis. Information in the warehouse is typically stored in the form of materialized views. One of the most important tasks when designing a warehouse is the selection of materialized views to be maintained in the warehouse. The goal is to select a set of views in such a way as to minimize the total query response time over all queries, given a limited amount of storage space and time for maintaining the views (view selection problem). The paper focuses on an efficient solution to the view selection problem using a genetic algorithm for computing a near-optimal set of views. Specifically, we explore the view selection problem in the context of OR view graphs. We show that our approach represents a dramatic improvement in time complexity over existing search-based approaches using heuristics. Our analysis shows that the algorithm consistently yields a solution that lies within 10% of the optimal query benefit while at the same time exhibiting only a linear increase in execution time. We have implemented a prototype version of our algorithm which is used to simulate the measurements used in the analysis of our approach.
منابع مشابه
Speeding Up Materialized View Selection in Data Warehouses Using a Randomized Algorithm
A data warehouse stores information that is collected from multiple, heterogeneous information sources for the purpose of complex querying and analysis. Information in the warehouse is typically stored in the form of materialized views, which represent pre-computed portions of frequently asked queries. One of the most important tasks when designing a warehouse is the selection of materialized v...
متن کاملSpeeding up the Stress Analysis of Hollow Circular FGM Cylinders by Parallel Finite Element Method
In this article, a parallel computer program is implemented, based on Finite Element Method, to speed up the analysis of hollow circular cylinders, made from Functionally Graded Materials (FGMs). FGMs are inhomogeneous materials, which their composition gradually varies over volume. In parallel processing, an algorithm is first divided to independent tasks, which may use individual or shared da...
متن کاملTwo-warehouse system for non-instantaneous deterioration products with promotional effort and inflation over a finite time horizon
In the current global market, organizations use many promotional tools to increase their sales. One such tool is sales teams’ initiatives or promotional policies, i.e., free gifts, discounts, packaging, etc. This phenomenon motivates the retailer/or buyer to order a large inventory lot so as to take full benefit of promotional policies. In view of this the present paper considers a two-warehous...
متن کاملUsing Materialized Views To Speed Up Data Warehousing
Running analytical queries directly against the huge raw data volume of a data warehouse results in unacceptable query performance. The solution to this problem is storing materialized views in the warehouse, which pre-aggregate the data and thus avoid raw data access and speed up queries. In this paper, at first the problems concerning the selection of the right pre-aggregations and their util...
متن کاملSpeeding Up Materialized-View Maintenance Using Cheap Filters at the Warehouse
We consider the problem of speeding up the incremental maintenance of materialized views defined by conjunctive queries (CQ) over external base relations, when querying these base relations is expensive. Our approach consists of detecting, without using the base relations, situations where a view either is not affected by a base update (VDU) or can be maintained using only the views in the ware...
متن کامل